Logarithmic Time Online Multiclass prediction

نویسندگان

  • Anna Choromanska
  • John Langford
چکیده

We study the problem of multiclass classification with an extremely large numberof classes (k), with the goal of obtaining train and test time complexity logarith-mic in the number of classes. We develop top-down tree construction approachesfor constructing logarithmic depth trees. On the theoretical front, we formulate anew objective function, which is optimized at each node of the tree and createsdynamic partitions of the data which are both pure (in terms of class labels) andbalanced. We demonstrate that under favorable conditions, we can construct loga-rithmic depth trees that have leaves with low label entropy. However, the objectivefunction at the nodes is challenging to optimize computationally. We address theempirical problem with a new online decision tree construction procedure. Exper-iments demonstrate that this online algorithm quickly achieves improvement intest error compared to more common logarithmic training time approaches, whichmakes it a plausible method in computationally constrained large-k applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Logarithmic Time One-Against-Some

We create a new online reduction of multiclass classification to binary classification for which training and prediction time scale logarithmically with the number of classes. We show that several simple techniques give rise to an algorithm that can compete with one-against-all in both space and predictive power while offering exponential improvements in speed when the number of classes is large.

متن کامل

Log-time and Log-space Extreme Classification

We present LTLS, a technique for multiclass and multilabel prediction that can perform training and inference in logarithmic time and space. LTLS embeds large classification problems into simple structured prediction problems and relies on efficient dynamic programming algorithms for inference. We train LTLS with stochastic gradient descent on a number of multiclass and multilabel datasets and ...

متن کامل

The price of bandit information in multiclass online classification

We consider two scenarios of multiclass online learning of a hypothesis class H ⊆ Y X . In the full information scenario, the learner is exposed to instances together with their labels. In the bandit scenario, the true label is not exposed, but rather an indication whether the learner’s prediction is correct or not. We show that the ratio between the error rates in the two scenarios is at most ...

متن کامل

Efficient Online Multiclass Prediction on Graphs via Surrogate Losses

We develop computationally efficient algorithms for online multi-class prediction. Our construction is based on carefully-chosen data-dependent surrogate loss functions, and the new methods enjoy strong mistake bound guarantees. To illustrate the technique, we study the combinatorial problem of node classification and develop a prediction strategy that is linear-time per round. In contrast, the...

متن کامل

Efficient Online Bandit Multiclass Learning with Õ(√T) Regret

We present an efficient second-order algorithm with Õ( 1 η √ T )1 regret for the bandit online multiclass problem. The regret bound holds simultaneously with respect to a family of loss functions parameterized by η, for a range of η restricted by the norm of the competitor. The family of loss functions ranges from hinge loss (η = 0) to squared hinge loss (η = 1). This provides a solution to the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015